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Primary Structure and Translation of a Defective Interfering RNA of Murine Coronavirus 
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An intracellular defective-interfering (Dl) RNA, DIssE, of mouse hepatitis virus (MHV) obtained after serial high multi¬ 
plicity passage of the virus was cloned and sequenced. DIssE RNA is composed of three noncontiguous genomic 
regions, representing the first 864 nucleotides of the 5' end, an internal 748 nucleotides of the polymerase gene, and 
601 nucleotides from the 3' end of the parental MHV genome. The DIssE sequence contains one large continuous open 
reading frame. Two protein products from this open reading frame were identified both by in vitro translation and in Dl- 
infected cells. Sequence comparison of DIssE and the corresponding parts of the parental virus genome revealed that 
DIssE had three base substitutions within the leader sequence and also a deletion of nine nucleotides located at the 
junction of the leader and the remaining genomic sequence. The 5' end of DIssE RNA was heterogeneous with respect 
to the number of UCUAA repeats within the leader sequence. The parental MHV genomic RNA appears to have exten¬ 
sive and stable secondary structures at the regions where Dl RNA rearrangements occurred. These data suggest that 
MHV Dl RNA may have been generated as a result of the discontinuous and nonprocessive manner of MHV RNA 
synthesis. © 1988 Academic Press, Inc. 


INTRODUCTION 

Mouse hepatitis virus (MHV), a member of the Coro- 
naviridae, contains a single-stranded, positive-sense 
RNA of approximately 6 X 10® Da (Lai and Stohlman, 
1978; Wege era/., 1978). In infected cells, the genomic 
RNA of MHV is first translated into an RNA-dependent 
RNA polymerase (Brayton et a/., 1982, 1984; Mahy et 
al., 1983) which is responsible for the synthesis of a 
genomic-sized negative-stranded RNA (Lai et al., 
1982b). The negative-stranded RNA then serves as the 
template for the synthesis of six subgenomic and a 
genomic-sized mRNA (Brayton et al., 1984; Lai et al., 
1982b). These mRNAs are arranged in the form of a 
3'coterminal "nested" set, i.e., the sequence of each 
mRNA is contained entirely within the next larger 
mRNA (Lai era/., 1981; Leibowitz era/., 1981). In addi¬ 
tion, each mRNA has a common leader sequence, 
which is derived from the 5' end of the genome (Lai et 
at., 1982a, 1983, 1984; Spaan era/., 1983). Several 
pieces of evidence demonstrated that MHV utilizes a 
novel mechanism of leader RNA-primed transcription, 
in which a free leader RNA species derived from the 
5' end of genomic RNA is utilized as a primer for the 
transcription of subgenomic mRNAs (Baric era/., 1983, 
1985; Makino eta/., 1986b). 

Another unusual feature of coronavirus RNA synthe¬ 
sis is that the virus undergoes RNA-RNA recombina¬ 
tion at a very high frequency (Makino era/., 1986a). The 
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unusually high frequency, approaching 10% under 
some circumstances (Makino et al., 1986a), of coro¬ 
navirus RNA recombination suggests that discontinu¬ 
ous RNA transcripts might be generated during coro¬ 
navirus RNA synthesis. These incomplete RNA inter¬ 
mediates may rejoin the original or different RNA 
template to continue RNA synthesis, resulting in RNA 
recombination in the latter case. The detection of such 
RNA intermediates in MHV-infected cells (Baric et a/., 
1985, 1987) suggests that coronavirus genomic RNA 
synthesis involves a discontinuous and nonprocessive 
mechanism, which may account for the high frequency 
of recombination via a copy choice mechanism. 

Defective-interfering (Dl) particles are naturally oc- 
curing deletion mutants that have been described for 
many virus groups. Characteristically, Dl particles (a) 
lack part of the viral genome, (b) contain normal viral 
structural proteins, (c) replicate only with the aid of a 
helper standard virus, and (d) interfere with replication 
of homologous standard virus. Deletion of genomic se¬ 
quence can occur in various regions of the genome; 
however, all of the Dl RNAs apparently retain signals 
for RNA replication since they can be replicated in the 
presence of helper virus. The generation of Dl RNA can 
be viewed as the result of abnormal RNA replication or 
illigitimate RNA recombination. Therefore, the struc¬ 
ture of Dl RNA is of particular interest in elucidation of 
the mechanism of viral RNA replication and recombina¬ 
tion. 

We have previously reported the generation of Dl 
particles during high multiplicity passages of the JHM 
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Fig. 1. Intracellular RNA species in Dl-infected cells. 32 P-Labeled 
RNA from MHV-JHM-infected cells (a) and Dl particles-infected cells 
(b) were electrophoresed in a 1% agarose gel without denaturation. 
Numbers 1, 2, 3, 6, and 7 represent the major MHV-JHM-specific 
mRNA species. 


strain of MHV (MHV-JHM) (Makino etal., 1984a). In Dl- 
infected cells, the synthesis of most of the standard 
viral mRNAs is inhibited. Instead, three distinct virus- 
specific RNA species could be detected (Makino etal., 
1985) (Fig. 1). The first species, DIssA, is equivalent to 
Dl virion RNA in length and is eventually incorporated 
into virus particles. This RNA differs from the standard 
virus genome in that it contains multiple deletions dis¬ 
tributed throughout the genome, except for the 5' and 
3' ends of the genomic RNA (Makino et at., 1985), 
which encode RNA polymerase (gene A) and nucleo- 
capsid (N) protein, respectively. Surprisingly, DIssA 
RNA can replicate by itself in the absence of helper vi¬ 
rus infection, suggesting that DIssA codes for func¬ 
tional RNA polymerases (Makino et at, 1988). Thus, 
DIssA is not a defective RNA in a strict sense. The sec¬ 
ond major RNA species found in Dl-infected cells is in¬ 
distinguishable from the mRNA 7 made by the standard 
virus. The synthesis of this mRNA and its product N 
protein is not inhibited in Dl-infected cells. The third 
RNA species is a novel single-stranded polyadenylated 
Dl RNA species of varying size. Oligonucleotide fin¬ 
gerprinting studies suggest that it represents se¬ 


quences derived from various noncontiguous parts of 
the genome. The size of this RNA varies with the Dl 
passage level (Makino et at, 1985). One of these 
RNAs, DIssE, which is the smallest Dl RNA detected, 
has been analyzed in greater detail (Makino et at, 
1988). In contrast to DIssA, DIssE RNA synthesis re¬ 
quires helper virus coinfection (Makino et at, 1988). 
Only a trace amount of it is incorporated into virus parti¬ 
cles to serve as a template for RNA replication (Makino 
et at, 1988). Thus, it may lack packaging signals. On 
the other hand, since it is efficiently replicated in Dl- 
infected cells, DIssE RNA must contain the sequences 
essential for viral RNA replication. 

In the present study, we analyzed the primary struc¬ 
ture of DIssE RNA. The results revealed that DIssE con¬ 
sists of three noncontiguous regions of MHV-JHM ge¬ 
nomic RNA, including 5' end leader RNA and the 3' end 
of genomic RNA. One large open reading frame (ORF) 
was demonstrated and the product of this ORF was 
identified both in infected cells and by in vitro transla¬ 
tion. Possible mechanisms of Dl RNA generation are 
discussed. 

MATERIALS AND METHODS 
Viruses and cell culture 

MHV-JHM was used as a nondefective standard vi¬ 
rus. Serially passaged MHV-JHM stock at passage 
level 17 was used as the source of Dl particles (Makino 
et at, 1985). All viruses were propagated in DBT cells 
as described previously (Makino et at, 1984a). 

Preparation of virus-specific intracellular RNA 

MHV-specific intracellular RNA was extracted by 
procedures described previously (Makino et at, 
1984b). Poly(A)-containing RNA was obtained by oli- 
go(dT)-cellulose column chromatography (Makino et 
al., 1984b). 

Agarose gel electrophoresis 

32 P-Labeled virus-specific RNA was analyzed by 
electrophoresis on 1% agarose gels without dena¬ 
turing as described previously (Makino et at, 1988). 
Poly(A)-containing RNA was purified by preparative gel 
electrophoresis in 1 % urea-agarose gels as previously 
described (Makino et at, 1984a). The RNA was eluted 
from gel slices by the methods of Langridge et at, 
(1980). 

cDNA cloning of DIssE 

cDNA cloning followed the general method of Gubler 
and Hoffman (1983). Five hundred nanograms of oli- 
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go(dT) 12 _ 18 was mixed with 2 nQ of gel-purified DIssE 
RNA in 10 n\ of distilled water. The RNA and oligo(dT) 
mixture was heated at 70° for 3 min and chilled quickly. 
The RNA-DNA hybrid was then incubated in 50 /tl of 
first-strand cDNA synthesis buffer containing 60 units 
of RNasin (Promega Biotec), 50 mM Tris-HCI (pH 8.3 
at 42°), 100 m M KCI, 10 mM MgCI 2 , 10 mM DTT, 1.25 
mM each of dATP, dCTP, dGTP, and TTP, and 20 units 
of avian myeloblastosis virus reverse transcriptase (Life 
Science) at 42° for 1 hr. The cDNA synthesis was 
stopped by adding 4.4 fi\ of 250 mM EDTA. Nucleic 
acids were extracted with phenol-chloroform-isoamyl 
alcohol and precipitated with ethanol. 

Second-strand synthesis was carried out in a reac¬ 
tion volume of 100 /x I containing 20 mM Tris-HCI (pH 
7.4), 5 mM MgCI 2 , 100 mM KCI, 50 ftg/ml of BSA, 10 
m/W (NH 4 ) 2 S0 4 , 0.15 mM/3-NAD, 100 nM dNTPs, 25 
units of DNA polymerase I, 2 units of Escherichia coli 
DNA ligase, 0.8 units of RNase H, and the product from 
the first strand reaction. The mixture was incubated at 
12° for 1 hr, and then at 22° for 1 hr. The reaction was 
stopped by adding 8.7 fi\ of 250 m/W EDTA, and prod¬ 
ucts were extracted with phenol-chloroform-isoamyl 
alcohol, and precipitated with ethanol. Double- 
stranded DNA was dC-tailed in a 12-^1 reaction mixture 
containing 10 units of terminal transferase, 200 m/W 
potassium cacodylate, 0.5 m/W CoCI 2 , 25 mMTris- 
HCI (pH 6.9), 2 mM DTT, 250 M g/ml BSA, and 50 nM 
dCTP at 37° for 4 min. The dC-tailed double-stranded 
DNA was annealed to 200 ng of dG-tailed Psfl-cut 
pBR322 plasmid in 20 fA of a buffer containing 10 mM 
Tris-HCI (pH 7.4), 100 mM NaCI, and 0.25 mM EDTA. 
The DNA mixture was heated at 68° for 5 min and then 
cooled slowly overnight for annealing. The annealed 
molecules were used to transform E. coli MCI061 as 
described (Dagert and Ehrlich, 1979). 

Identification of large cDNA clones containing DIssE 
sequence 

32 P-Labeled MHV-JHM gene A cDNA clones C96 
and F82 (Shieh etai, 1987) and 5' end 32 P-labeled lead¬ 
er-specific 72-mer derived from leader sequence of 
MHV (Lai et al., 1984) were used for colony hybridiza¬ 
tion (Shieh et al., 1987) to isolate DIssE-specific cDNA 
clones. Colonies yielding a strong signal were further 
analyzed by Southern hybridization (Maniatis et al., 
1982). 

Primer extension 

The gel-purified RNAs were incubated in 8 fi\ of dis¬ 
tilled water containing 10 mM methyl mercury. After 10 
min incubation at room temperature, RNA was incu¬ 


bated in 50 mI of first-strand cDNA synthesis buffer with 
28 mM /8-mercaptoethanol and 5' end-labeled oligo- 
deoxyribonucleotides at 42° for 1 hr. Reaction prod¬ 
ucts were extracted with phenol-chloroform-isoamyl 
alcohol, precipitated with ethanol, and analyzed by 
electrophoresis on 6% polyacrylamide gels containing 
8.3 M urea and were eluted from the gels according to 
the published procedures (Maxam and Gilbert, 1980). 

DNA sequencing 

Sequencing was carried out by Sanger’s dideoxyri- 
bonucleotide chain termination method (Sanger et al., 
1977) and Maxam-Gilbert chemical modification pro¬ 
cedure (Maxam and Gilbert, 1980), as described pre¬ 
viously (Soe et al., 1987). Sequence analysis and pre¬ 
dicted RNA secondary structures were obtained with 
the Intelligenetics sequencing program. 

In vitro translation 

An mRNA-dependent rabbit reticulocyte lysate (New 
England Nuclear) was used as previously described 
(S oeetal., 1987). 

Antisera 

A monoclonal antibody, J.3.3, directed against the 
MHV-JHM N protein has been described (Fleming et 
al., 1983). The anti-p28 antibody was generated in rab¬ 
bits against a synthetic peptide representing a portion 
of the MHV-JHM p28 protein (Soe et al., 1987) and will 
be described in detail elsewhere (S. C. Baker et al., 
manuscript in preparation). 

Labeling of intracellular proteins, 
immunoprecipitation, and SDS-polyacrylamide 
gel electrophoresis 

DBT cells were infected with either wild type MHV- 
JHM or MHV-JHM containing Dl particles at 2 PFU per 
cell. At 7.5 hr postinfection, cells were labeled in methi¬ 
onine-free medium containing 30 nC\ of L-[ 35 S]methio- 
nine/ml (ICN translabel) for 30 min. Cell extracts were 
prepared by treatment with lysolecithin (L-a-lysophos- 
phatidylcholine, palmitoyl; Sigma) at 125 /*g/ml for 1 
min at 4°. The treated cells were scraped in 300 /*1 HND 
buffer (0.1 M HEPES, pH 8.0, 0.2 M NH 4 CI, 0.005 M 
DTT), disrupted by pipetting with a Pastuer pipet, and 
then centrifuged at 800 g for 5 min to remove nuclei 
and cell debris. The resulting supernatant was used for 
immunoprecipitation. 

Immunoprecipitation was performed by the methods 
of Kessler (1981). The cell-free extracts were incubated 
with 3 ^1 of antisera for 4 hr at 4°. The antigen-antibody 
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complexes were collected by binding to Pansorbin 
(Calbiochem, La Jolla, CA) and washed three times 
with washing buffer(50 m/WTris-HCI, pH 7.4, 150 m M 
NaCI, 5 m M EDTA, and 0.5% NP-40) and eluted by boil¬ 
ing for 2 min in electrophoresis sample buffer (0.1 M@- 
mercaptoethanol, 1% SDS, 0.08 /WTris-HCI, pH 6.8, 
and 10% glycerol). The bacteria were removed by cen¬ 
trifugation and proteins were analyzed by electrophore¬ 
sis on 5 to 15% SDS-polyacrylamide gels (Laemmli, 
1970). 

RESULTS 

cDNA cloning and sequencing of DIssE RNA 

To understand the primary structure of DIssE RNA, 
DIssE-specific cDNA clones were generated accord¬ 
ing to the general method of Gubler and Hoffman 
(1983), using oligo(dT) as a primer and gel-purified 
DIssE RNA. Since previous oligonucleotide fingerprint¬ 
ing analysis suggested that DIssE RNA contains the 
leader sequence and the 5' end region of genomic se¬ 
quence (Makino et al., 1985), cDNA clones were 
screened by colony hybridization using 5' end-labeled, 
leader-specific 72-mer, and two cDNA clones F82 and 
C96, which correspond to the 5' end of genomic RNA 
of MHV-JHM (Shieh et a!., 1987). Several large cDNA 
clones were isolated and their structure was further an¬ 
alyzed. A diagram representing the structure of the 
DIssE genome and that of MHV-JHM genomic RNA 
and the strategy used for sequencing the cDNA clones 
are shown in Fig. 2. The DIssE sequence obtained is 
shown in Fig. 3. 

Sequence analysis of DIssE cDNA clones revealed 
that DIssE RNA consists of three different regions of 
MHV-JHM genomic RNA. The first region represents 
864 nucleotides from the 5' end of the genomic RNA. 
The second region, 748 nucleotides in length, is a re¬ 
gion within the polymerase gene that corresponds to 
the region at 3.3 to 4 kb from the 5' end of genomic 
RNA (Shieh, unpublished observation), and the third re¬ 
gion contains a sequence of 601 nucleotides derived 
from the extreme 3' end of the genomic RNA. The entire 
sequence of DIssE RNA is identical to that of the corre¬ 
sponding regions of MHV genomic RNA (Skinner and 
Siddell, 1983; Soe et at., 1987; Shieh et a!., unpub¬ 
lished data), with some exceptions in the leader se¬ 
quence region (see below). 

The cDNA clones obtained does not appear to have 
a complete sequence at its extreme 5' end. To under¬ 
stand the complete 5' end sequence of DIssE, we per¬ 
formed primer-extension studies on DIssE RNA using 
a specific primer (5'-AATGTCAGCACTATGACA-3') 
complementary to nucleotides 123-140 from the 5' 


end of the genome of MHV-JHM (Shieh et al., 1987). 
The 5' end-labeled primer was hybridized to gel-purified 
DIssE RNA and extended with reverse transcriptase. 
Primer extension products were then analyzed by elec¬ 
trophoresis on 6% polyacrylamide gels containing 8 M 
urea. As shown in Fig. 4A, two cDNA products of 136 
and 131 nucleotides were obtained, indicating hetero¬ 
geneity at the 5' end sequence of DIssE. These primer- 
extended products were sequenced by the Maxam- 
Gilbert method. The sequences of both cDNA prod¬ 
ucts were identical except that the faster migrating 
cDNA products contained three UCUAA repeats at the 
3' end of the leader sequence, while the slower migrat¬ 
ing species contained four UCUAA repeats (Fig. 4B). In 
addition, the 5'end sequences of DIssE and MHV-JHM 
genomic RNA showed several differences. Within the 
leader sequence, 3 bases were substituted in DIssE 
RNA (Fig. 4B, asterisks) and nine nucleotides (UUUAU- 
AAAC) were deleted in DIssE at the junction between 
the leader RNA and the remaining genomic se¬ 
quences. The significance of the heterogeneity in the 
number of UCUAA repeats and of the nine-nucleotide 
deletion will be discussed below. 

Translation of DIssE RNA in vitro and in vivo 

Another significant feature of DIssE RNA is the pres¬ 
ence of a single large ORF (Fig. 3). This ORF is ex¬ 
pected to share amino acid sequence identity with 
three different regions of the standard MHV-JHM. The 
first 218 amino acids correspond to the N terminus of 
the MHV polymerase. This region represents the part 
of the N-terminus of the polymerase protein which is 
cleaved into a p28 protein (Denison and Perlman, 
1986; Soe et al., 1987). The following 250 amino acids 
were derived from the region of the polymerase at 3.3 
to 4 kb from the 5' end of the genome. The 3' end region 
of the ORF of DIssE RNA is the same as the ORF uti¬ 
lized forthe N protein (Skinner and Siddell, 1983). Thus, 
the predicted product of this ORF should contain the 
N-terminus of p28 and the C-terminus of the N protein. 
The predicted molecularweight mass of this ORF prod¬ 
uct is 62,538. 

To examine whether the ORF of DIssE RNA is utilized 
for translation, we first performed in vitro translation in 
a rabbit reticulocyte lysate of DIssE RNA purified from 
the Dl-infected cells. Two proteins with an apparent 
molecular mass of approximately 88,000 (88K) and 
79,000 (79K) were detected (Fig. 5A). Both proteins 
were immunoprecipitated with anti-N protein mono¬ 
clonal antibody and anti-p28 antibody (Fig. 5A, lanes 2 
and 3). Therefore, these two proteins were likely the 
translation products of DIssE RNA. A minor band of ap- 
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Fig. 2. Diagram of the structure of DIssE RNA and the strategy used for sequencing DIssE cDNA clones, (a) A comparison between the 
sequence of DIssE RNA and that of the standard MHV-JHM genomic RNA. A-G represent the seven genes of MHV (Lai et al., 1981). (b) Structure 
of DIssE-specific cDNA clones, (c) Strategy for sequencing of DIssE. Arrows starting with solid circles indicate DNA sequenced by the Maxam- 
Gilbert chemical method with 3' end-labeled DNA. Arrows starting with open circles indicate DNA sequenced by the dideoxy method. 


proximately 60 kDa had the same electrophoretic mo¬ 
bility as the N protein of MHV-JHM, and was precipi¬ 
tated with anti-N monoclonal antibody, but not with 
anti-p28 antibody (Fig 5A, lanes 2 and 3). Thus, this 
protein is most likely the N protein translated from the 
contaminated mRNA 7 in the DIssE RNA preparation. 

The synthesis of DIssE-specific protein in Dl-infected 
cells was then examined. DBT cells were mock-in¬ 
fected (Fig. 5B, lanes 1 and 4), infected with MHV-JHM 
(Fig. 5B, lanes 2 and 5), or infected with MHV-JHM con¬ 
taining Di particles (Fig. 5B, lanes 3 and 6). Both 88K 
and 79K proteins were specifically immunoprecipitated 
with anti-N monoclonal antibody and anti-p28 antibody 
from Dl-infected cells. The amount of these two pro¬ 
teins was low as compared to the N protein. Neverthe¬ 
less, they were reproducibly detected in Dl-infected 
cells. Thus, the DIssE RNA is a functional mRNA. The 
relationship between the two protein species detected 
is not clear. The discrepancy between the predicted 
and obsen/ed molecular weights of the translation 


products of DIssE could be due to post-translational 
modification of the protein or aberrant migration of the 
protein. A small amount of p28 was immunoprecipi¬ 
tated with anti-p28 antibody in MHV-JHM-infected cells 
(Fig. 5B, lane 5). However, this protein was hardly de¬ 
tectable in Dl-infected cells (Fig. 5B, lane 6). The ab¬ 
sence of detectable amount of p28 in Dl-infected cells 
may be due to the inhibition of MHV-JHM genomic RNA 
synthesis by DI particles (Makino et al,, 1985). 

Possible secondary structure at the DI RNA 
rearrangment sites 

Sequence analysis revealed that DIssE RNA con¬ 
sisted of three noncontiguous regions of MHV-JHM ge¬ 
nomic RNA. We have previously proposed that coro- 
navirus RNA synthesis proceeds by a discontinuous, 
nonprocessive mechanism, being interrupted at sites 
with hairpin loops (Baric et al., 1987). This transcrip¬ 
tional interruption could account for the generation of 
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5 1 -TATAAGAGTGAATGGCGTCCGTACGTACCCAATCTACTCTAAAACTCTTGTAGTTTAAATCTAATCTAATCTAATCTAAACGGC 8 4 

MPVGLVLSZ 

8 5 ACTTCCTGCGTGTCCATGCCCGTGGGCCTGGTCTTGTC ATAGTGCXGACATTTGTGGTTCCTTGACTTTCTGTCTCTGCCAGTG 168 

1 MAKMGKYGLGFKWA 14 

169 ACGTGTCCATTCGGCGCCAGCAGCCCACCCATAGGTTGCATAATGGCAAAGATGGGCAAATACGGTCTCGGCTTCAAATGGGCC 252 

15 PEFPWMLPNASEKLGNPERSEEDGFCPS 42 

253 CCAGAATTTCCATGGATGCTTCCGAACGCATCGGAGAAGTTGGGTAACCCTGAGAGGTCAGAGGAGGATGGGTTTTGCCCCTCT 336 

43 AAQEPKVKGKTLVNHVRVDCSRLPALEC 70 

337 GCTGCGCAAGAACCGAAAGTTAAAGGAAAAACTTTGGTTAATCACGTGAGGGTGGATTGTAGCCGGCTTCCAGCCTTGGAGTGC 420 

71 CVQSAIXRDIFVDEDPQKVEASTMMAEQ 98 

421 TGTGTTCAGTCCGCCATAATCCGTGATATTTTTGTTGACGAGGATCCCCAGAAGGTGGAGGCCTCGACTATGATGGCATTGCAG 504 

99 FGSAVLVKPSKRLSVQAWAKLGVLPKTP 126 

505 TTCGGTAGTGCTGTCTTGGTCAAGCCATCCAAGCGCTTGTCTGTTCAGGCATGGGCTAAGTTGGGTGTGCTGCCTAAAACTCCG 588 

127 AMGLFKRFCLCNTRECVCDAHVAFQLFT 154 

589 GCCATGGGGTTGTTCAAGCGCTTCTGCCTGTGTAACACCAGGGAGTGCGTTTGTGACGCCCACGTGGCCTTTCAACTTTTTACG 672 

155 VQPDGVCLGNGRFIGWFVPVTAIPEYAK 182 

673 GTCCAGCCCGATGGTGTATGCCTGGGTAACGGCCGTTTTATAGGCTGGTTCGTTCCAGTCACAGCCATACCGGAGTATGCGAAG 756 

183 QWLQPWSILLRKGGNKGSVTSGHFRRAV 210 

757 CAGTGGTTGCAACCCTGGTCCATCCTTCTTCGTAAGGGTGGTAACAAAGGGTCTGTGACATCCGGCCATTCCCGCCGCGCTGTT 840 

211 TMPVYDFNATDVVYADENQDDDADDPVV 238 

841 ACCATGCCTGTGTATGACTTTAATGCAACAGATGTTGTATATGC AGATGAAAACCAAGATGATGATGCTGACGATCCTGTAGTC 924 

▲ 

239 LVADTQEEDGVAREQVDSADSEICVAHT 266 

925 CTTGTCGCCGATACCCAAGAAGAGGACGGCGTTGCCAGGGAGCAGGTTGArtCGGCTGATTCGGAAATTTGTGTTGCGCACACT 1008 

267 GGQEMTEPDVVGSQTPIASAEETEVGEA 294 

1009 GTTGGTCAAGAAATGACTGAGCCTGATGTCGTCGGATCTC AAACTCCCATCGCCTCTGCTGAGGAAACCGAAGTCGGTGAGGCA 1092 

295 CDREGIAEVKATVCADALDACPDQVEAF 322 

1093 TGCGACAGGGAAGGGATTGCTGAGGTCAAGGCAACTGTGTGTGCTGATGCTTTAGATGCCTGCCCCGATCAAGTGGAGGCATTT 1176 

323 DIEKVEDSILSELQTELNAPADKTYEDV 350 

1177 GATATTGAAAAGGTTGAAGACAGTATCTTAAGTGAGCTTCAAACCGAACTTAATGCGCCCGCGGACAAGACCTATGAGGATGTC 1260 

351 LAFDAIYSETLSAFYAVPSDETHFKVCG 378 

1261 TTGGCATTCGATGCCATATACTCAGAGACGTTGTCTGCATTCTATGCTGTGCCGAGTGATGAGACGCACTTTAAAGTGTGTGGA 1344 

379 FYSPAIERTNCWLRSTLIVMQSLPLEFK 406 

1345 TTCTATTCGCCAGCTATAGAGGCTACTAATTGTTGGCTGCGTTCTACTTTGATAGTAATGCAGAGTTTACCTTTGGAATTTAAA 1428 

407 DLGMQKLWLSYKAGYDQCFVDKLVKSAP 434 

1429 GACTTGGGGATGCAAAAGCTCTGGTTGTCTTACAAGGCTGGCTATGATCAATGCTTTGTGGACAAACTAGTTAAGAGCGCGCCC 1512 

435 KSIILPQGGYVADFAYFFLSQCSFKVHA 462 

1513 AAGTCTATTATTCTTCCACAAGGTGGCTATGTGGC AGATTTTGCCTATTTTTTCCTAAGCCAGTGTAGCTTCAAAGTTCATGCT 1596 

463 NWRCLKRFDSTLPGFETIMKVLNENLNA 490 

1597 AACTGGCGTTGTCTAAAGAGATTTGATAGTACTCTACCTGGTTTTGAGACTATCATGAAAGTGTTGAATGAGAATTTGAATGCC 1680 

▲ 

491 YQNQDGGADVVSPKPQRKRGTKQKAQKD 518 

1681 TACCAGAATCAAGATGGTGGTGCAGATGTAGTGAGCCCTAAGCCTCAGAGAAAGAGAGGGACAAAGCAAAAGGCTCAGAAAGAT 1764 

519 EVDNVSVAKPKSSVQRNVSRELTPEDRS 546 

1765 GAAGTAGATAATGTAAGCGTTGCAAAGCCCAAAAGCTCTGTGCAGCGAAATGTAAGTAGAGAGTTAACCCCTGAGGATCGCAGC 1848 

547 LLAQILDDGVVPDGLEDDSNVZ 567 

1849 CTTCTGGCTCAGATCCTAGATGATGGCGTAGTGCCAGATGGGTTAGAAGATGACTCTAATGTGTAAAGAGAATGAATCCTATGT 1932 

1933 CGGCACTCGGTGGTAACCCCTCGCGAGAAAGTCGGGATAGGACACTCTCTATCAGAATGGATGTCTTGCTGTCATAACAGATAG 2016 

2017 AGAAGGTTGTGGCAGACCCTGTATCAATTAGTTGAAAGAGATTGCAAAATAGAGAATGTGTGAGAGAAGTTAGCAAGGTCCTAC 2100 

2101 GTCTAACCATAAGAACGGCGATAGGCGCCCCCTGGGAAGAGCTCACATCAGGGTACTATTCCTGCAATGCCCTAGTAAATGAAT 2184 

2185 GAAGTTGATCATGGCCAATTGGAAGAATCAC-poly (A)-3' 2215 

Fig. 3. DNA sequence and deduced amino acid sequence of the DIssE cDNA clones. The extreme 5’ end sequence was obtained by primer- 
extension studies (see Fig. 4). A translation of the main ORF is shown in single-letter amino acid code. Solid triangles indicate the sites where 
sequence fusion occurred. 
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10 20 30 40 50 

UAUAAGAGUGAOUGGCGUCCGUACGUACCCUCUCUACUCUAAAACUCUUG 

UAUAAGAGUGAAUGGCGUCCGUACGUACCCAAUCUACUCUAAAACUCUUG 

UAUAAGAGUGAAUGGCGUCCGUACGUACCCAAUCUACUCUAAAACUCUUG 


60 7 0 a 80 b 90 100 

UAGUUUAAA UCUAAUCUAAUCUAA ACUUUAUAAACGGCACUUCCUGCGUG 

1 2 3 

UAGOOOAAA UCUAAUCUAAUCUAAUCUAA ACGGCACUUCCUGCGUGOCCA 
12 3 4 

UAGUUUAAA UCUAAUCUAAUCDAA ACGGCACUUCCUGCGUGUCCAUGCCC 

1 2 3 


110 120 130 140 

UCCAUGCCCGUGGGCCUGGUCUUGUCAUAGUGCUGACAUU 

UGCCCGOGGGCCUGGUCUUGUCAUAGUGCUGACAUO 

GUGGGCCUGGUCUUGUCAUAGUGCUGACAUU 

Fig. 4. Primer extension analysis of the 5'-end of DIssE. (A) The synthetic oligodeoxyribonucleotides (18-mer) complementary to the nucleo¬ 
tides 123-140 from the 5' end of the parental MHV-JHM genomic RNA (Shieh et al., 1987; Soe et a!., 1987) was 32 P-labeled at the 5' end, 
hybridized to the gel-purified DIssE RNA, and extended with reverse transcriptase. The products were electrophoresed on 6% polyacrylamide 
gels containing 8 M urea. O, origin of the gel. Two primer-extended products are shown as a and b. (B) The DNA sequences of these primer- 
extended products were determined by the Maxam-Gilbert method. The 5'-end sequence of MHV-JHM genomic sequence was obtained from 
previous studies (Shieh et at., 1987; Soe et al., 1987). The letters a and b represent the canonical seven-nucleotide sequence UCUAAAC and 
imperfectly repeated sequence of UAUAAAC, respectively. A bold solid line represents the nine-nucleotide sequence which is deleted in DIssE 
but present in MHV-JHM. DIssE (a) and DIssE (b) correspond to the sequences of primer-extended products, a and b, in Fig. 4A, respectively. 
Three base substitutions are indicated by asterisks. 
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Dl RNAs. We therefore examined whether any signifi¬ 
cant secondary structure existed at rearrangement 
sites on MHV-JHM genomic RNA. The nucleotide se¬ 
quences surrounding deleted regions of MHV-JHM ge¬ 
nomic RNA were analyzed by an RNA secondary struc¬ 
ture program of Zuker and Stiegler (1981). The pre¬ 
dicted secondary structures of these rearrangement 
regions are shown in Fig. 6. All four genomic deletion 
sites have extensive and stable secondary structures. 
The free energies of these structures range from -73.0 
to -114.2 kcal/mol. Furthermore, as previously de¬ 
scribed for the standard MHV-JHM, the sequence sur¬ 
rounding the junction of leader RNA and the remaining 
5'-end genomic sequence also contains a stable sec¬ 
ondary structure (Soe et al., 1987). This junction region 
includes the nine-nucleotide deletion detected in 
DIssE RNA (Fig. 4B). Thus, an extensive and stable 
secondary structure exists at each parental MHV-JHM 
genomic region where deletion occurred. 


DISCUSSION 

The present study demonstrated that the smallest 
Dl-specific RNA, DIssE, is composed of three discon¬ 
tiguous parts of the viral genome, including the 5' end 
and 3' end of genomic RNA. This structure is similar to 
many Dl RNAs of other viruses, which typically retain 
both ends of the standard nondefective viral RNAs. Our 
previous study has demonstrated that DIssE is repli¬ 
cated from its negative template in the presence of 
helper virus (Makino et al., 1988). Therefore, the DIssE 
sequence likely contains essential recognition signals 
for MHV RNA replication. The structure of DIssE RNA 
supports the likelihood that the recognition signals for 
the synthesis of negative-strand RNA and positive- 
strand RNA are localized at the 3' end and 5' end of 
genomic RNA, respectively. 

One of the unique features of coronavirus Dl RNA is 
that subgenomic Dl RNA was poorly incorporated into 
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Fig. 5. Translation of DIssE-specific proteins. (A) Translation in a rabbit reticulocyte lysate of gel-purified DIssE RNA. 35 S-Labeled in vitro 
translation products of DIssE were analyzed by SDS-polyacrylamide gel electrophoresis directly (lane 1), and immunoprecipitated with anti-N 
protein monoclonal antibody (lane 2) or anti-p28 antibody (lane 3). Lane 4 contains 14 C-labeled marker proteins. (B) DIssE-specific proteins in 
Dl-infected cells. DBT cells were mock-infected (lanes 1 and 4), infected with MHV-JHM (lanes 2 and 5), or infected with MHV-JHM containing 
Dl particles (Lanes 3 and 6). At 7.5 hr postinfection, cultures were labeled with [ 36 S]methionine for 30 min, and cytoplasmic lysates were 
prepared, immunoprecipitated with anti-N protein monoclonal antibody (lanes 1 -3) or anti-p28 antibody (lanes 4-6), and electrophoresed. Lane 
7 contains ' 4 C-labeled marker proteins. 


L A BCDEFG 



Fig. 6. Predicted secondary structure at the sequence rearrangement sites of MHV-JHM genomic RNA. The sequence of MHV-JHM genomic 
RNA was obtained from previously published data (Soe et at.. 1987) and our unpublished data (Shieh et at., unpublished data). A-G represent 
the seven genes of MHV RNA. Solid boxes correspond to regions which share with DIssE. Free energy of the secondary structure at each 
rearranged site is given in kilocalories per mole. Arrows indicate the rearrangement sites. 
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virus particles (Makino etal., 1988). One of the possible 
explanations is that the Dl subgenomic RNAs lack a 
packaging signal. Since all MHV-specific subgenomic 
mRNAs contain the leader sequence, yet only geno¬ 
mic-sized RNA is efficiently packaged into virus parti¬ 
cles, the packaging signal is probably located in gene 
A but not in leader sequence. The present study indi¬ 
cates that DIssE RNA has a nine-nucleotide (UUUAU- 
AAAC) deletion at the junction between the leader RNA 
and the remaining genomic RNA sequence. However, 
this deletion is not likely to account for the failure of 
efficient Dl RNA packaging into virus particles since 
DIssA and the genomic RNA of a mutant MHV-JHM, 
both of which are packaged into virus particles, also 
have similar nine-nucleotide deletions (S. Makino, un¬ 
published data). Thus, the packaging signals may be 
localized downstream of the 5' end 864 nucleotides. 
Recently we found that another intracellular Dl-specific 
RNA, DIssF, could be packaged more efficiently than 
DIssE (S. Makino, unpublished data). The DIssF RNA 
is approximately 1.7 kb larger than DIssE and appears 
to contain more gene A sequences than DIssE, as de¬ 
termined from T1-oligonucleotide fingerprinting (Ma¬ 
kino et at., 1985). Sequence analysis of DIssF may re¬ 
veal the possible reason for the poor incorporation of 
DIssE RNA into virus particles. 

The data presented in this paper demonstrate exten¬ 
sive and stable secondary structures in the standard 
viral RNA at sites where Dl RNA underwent deletions. 
This observation is consistent with a model of Dl RNA 
generation, in which RNA transcription is interrupted 
at sites of hairpin loops on the template, and the RNA 
intermediates then fall off and rebind at new sites on 
the template to generate an RNA with extensive dele¬ 
tions. We have previously suggested that coronavirus 
RNA synthesis may utilize a discontinuous, nonproces- 
sive mechanism, in which RNA transcription pauses at 
sites of secondary structures (Baric et a/., 1987). The 
incomplete RNA intermediates dissociate from tem¬ 
plates and then rejoin the temple for subsequent RNA 
transcription. This mechanism is supported by the 
findings that MHV can undergo RNA recombination at 
an extremely high frequency (Makino et at., 1986a), 
and that free incomplete RNA transcription products of 
various sizes were detectable in the cytoplasm of 
MHV-infected cells (Baric etal., 1985, 1987). Further¬ 
more, the sizes of these RNA products correspond to 
the lengths between the 5' end and the sites of hairpin 
loops (Baric et at., 1987), in agreement with the notion 
that transcription pauses at these hairpin loops. Thus, 
the potential hairpin loops present in the genomic RNA 
at the Dl RNA rearrangement sites could have inter¬ 
rupted RNA transcription. The incomplete RNA tran¬ 


script may join the RNA template at the downstream 
rearrangement sites and create deleted RNA as a re¬ 
sult. However, there is no consensus sequence at the 
sites of RNA deletion and reinitiation. It is not known 
how the reinitiation of RNA synthesis occurred. 

The deletion of the nine nucleotides (UUUAUAAAC) 
at the 5' end where the leader RNA joins the genomic 
RNA may have been caused by the same discontinu¬ 
ous and nonprocessive transcription mechanism. It is 
interesting to note that the UCUAAAC, which is the 
consensus sequence for the leader RNA binding (Shieh 
et at., 1987), is imperfectly repeated (UAUAAAC) at 
nine nucleotides downstream (Shieh etal., 1987). It is 
these nine nucleotides which were deleted in DIssE 
RNA. Similar nine-nucleotide deletions have also been 
noted in the genomic RNA of DIssA, and that of an 
MHV-JHM mutant virus (S. Makino, unpublished data). 
This RNA structure suggests that RNA synthesis may 
pause at the first repeat, and then reinitiate at the sec¬ 
ond repeat because of the binding of the incomplete 
RNA transcript to the second repeat. Finally, the heter¬ 
ogeneity in the number of UCUAA repeats in Dl RNAs 
also supports the discontinuous nature of coronavirus 
RNA replication. Similar heterogeneity has been noted 
in the genomic RNA of several different MHV strains (S. 
Makino and M, M. C. Lai, manuscript in preparation). 
Thus, Dl RNA may be a product of discontinuous, non¬ 
processive RNA replication of coronaviruses. 

There was a significant difference between the ap¬ 
parent molecular mass of the DIssE-specific protein 
products, 88K and 79K, and the predicted molecular 
mass of the potential product of the large ORF of DIssE 
RNA. This difference could be due to unusual configu¬ 
rations affecting electrophoretic migration, or due to 
the presence of phosphorylation, since the N protein is 
phosphorylated (Stohlman and Lai, 1979) and protein 
translated in vitro could be phosphorylated (Chatto- 
padhyay and Banerjee, 1987). A similar difference be¬ 
tween the predicted and actual molecular mass of the 
N protein has previously been noted (Skinner and Sid- 
dell, 1983). The relationship between the two protein 
species is not clear. The N protein has also been 
shown to consist of multiple species (Robbins et at., 
1986). It is not clear whether these proteins play any 
functional roles in Dl-infected cells. Typically, Dl RNAs 
do not synthesize any protein; however, in the Sindbis 
virus system, translation products have been detected 
from a Dl RNA(Migliaccio etal., 1985). 

Although MHV genomic RNA and DIssE RNA are the 
major RNA species among MHV-specific mRNA spe¬ 
cies in virus-infected cells (Makino era/., 1985, 1988) 
(Fig. 1), the gene products of these two mRNAs, RNA 
polymerase and both the 79K and 88K proteins, were 
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present in small quantities in virus-infected cells (Fig. 
5B). We have previously demonstrated that the pres¬ 
ence of stable secondary structure at the 5' end non¬ 
coding regions of the polymerase gene reduced the 
amount of polymerse protein synthesized in vitro (Soe 
et at., 1987). Also, as discussed previously, the pres¬ 
ence of the small ORF encoding eight amino acids (Fig. 
3) may reduce the number of ribosomes reaching the 
downstream optimal translation site (Soe et al., 1987). 
Since DIssE RNA has a 5' end structure similar to that 
of genomic RNA, the DIssE RNA may provide a tool to 
better our understanding of the mechanism of transla¬ 
tional control of MFIV RNAs. Furthermore, the fusion 
protein synthesized by DIssE RNA may be useful for 
understanding the functional and structural domains of 
the MFIV polymerase and N protein. 
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